Analyse appartements - Shapash

Interpretation des predictions appartements

Project_Information

Author : VotreNom

Description : Rapport Shapash pour appartements

Project_Name : Analyse extratrees_appart


Model analysis

Model used : ExtraTreesRegressor

Library : sklearn.ensemble._forest

Library version : 1.5.2

Model parameters :

Parameter key Parameter value
estimator ExtraTreeRegressor()
n_estimators 254
estimator_params ('criterion', 'max_depth', 'min_samples_split', 'min_samples_leaf', 'min_weight_fraction_leaf', 'max_features', 'max_leaf_nodes', 'min_impurity_decrease', 'random_state', 'ccp_alpha', 'monotonic_cst')
bootstrap False
oob_score False
n_jobs None
random_state None
verbose 0
warm_start False
class_weight None
max_samples None
criterion squared_error
max_depth 20
min_samples_split 5
Parameter key Parameter value
min_samples_leaf 1
min_weight_fraction_leaf 0.0
max_features 1.0
max_leaf_nodes None
min_impurity_decrease 0.0
ccp_alpha 0.0
monotonic_cst None
feature_names_in_ ['etage' 'surface' 'nb_pieces' 'balcon' 'eau' 'bain' 'dpeL' 'dpeC' 'mapCoordonneesLatitude' 'mapCoordonneesLongitude' 'annonce_exclusive' 'nb_etages' 'places_parking' 'cave' 'ges_class' 'annee_construction' 'nb_toilettes' 'ascenseur' 'nb_logements_copro' 'charges_copro' 'chauffage_energie' 'chauffage_systeme'...
n_features_in_ 56
_n_samples 11035
n_outputs_ 1
_n_samples_bootstrap None
estimator_ ExtraTreeRegressor()
estimators_ [ExtraTreeRegressor(max_depth=20, min_samples_split=5, random_state=1936962966), ExtraTreeRegressor(max_depth=20, min_samples_split=5, random_state=2069891226), ExtraTreeRegressor(max_depth=20, min_samples_split=5, random_state=2018050919), ExtraTreeRegressor(max_depth=20, min_samples_split=5,...

Dataset analysis

Global analysis

Training dataset Prediction dataset
number of features NaN 56
number of observations NaN 2,759
missing values NaN 0
% missing values NaN 0

Univariate analysis

etage - Numeric

Prediction dataset
count 2,759
mean -0.0135
std 0.976
min -0.514
25% -0.514
50% -0.155
75% 0.205
max 17.5

Target analysis

prix_m2_vente - Numeric

Prediction dataset
count 2,759
mean 2,550
std 1,070
min 216
25% 1,710
50% 2,440
75% 3,300
max 7,440

Multivariate analysis


Model explainability

Note : the explainability graphs were generated using the test set only.

Global feature importance plot

Features contribution plots

etage -


Model performance

Univariate analysis of target variable

prix_m2_vente - Numeric

True values Prediction values
count 2,759 2,759
mean 2,550 2,560
std 1,070 891
min 216 481
25% 1,710 1,850
50% 2,440 2,440
75% 3,300 3,180
max 7,440 7,020

Metrics

MAE : 347

R2 : 0.778

MSE : 256,000

MAPE : 0.172

MdAE : 234

Explained Variance : 0.778